Finding Topic Words for Hierarchical Summarization (DRAFT)
نویسندگان
چکیده
! "$#% & ' ( *) ' ,+! ./. *0 ( *) ' ,+ 1"2 ! 3) / 4#% . *) ' ,5768 2) :91 9 :;< =" ?>1 ! ./. ( *) @ ) ./ A #B 94 ' C) D ' 1 ' E./ F"4 1"E ! =) G" > H) ' E) I49 = / ?;J) LKM N#% N 4) 0 . *) HOE *) ) '9 = FO2 9 9 HO4 / 9 0 ) ?) B ' H) .E+M;: < 9 9 I4 . ) ' #P) :QN . 0 ) SR4 )DTU ' .E5WV< X H) .Y Z/ M) HO[ *) ) ? ./ \ " &) = L 1 ]./ F" ^5U_X N './9 3) N ; ) ? KM `) a94 F E./ ?) F" 29 9 ' ! "[#% b ' C)! ) ) 9 ? G 1"4 D ! ! ./9 ) ' " I4 c ? !0 +' ,;< F ,;< "4 ,#d ' "] ! V\e-5 6fQ]e-5 gB 4 , ! H) h ! ; ) )G) & ;i) LKM &9 !#d ./ = ];< G = )!) ? ]) 1
منابع مشابه
The TITech Summarization System at TAC-2009
This paper presents the TITech summarization system participating in TAC2009. Specifically, we discuss our results for the Update track. We propose a new method for creating summaries by ordering sentences. After a draft summary is obtained, we conduct agglomerative hierarchical clustering on the sentences of the draft summary based on sentence associativity. Then we use a probabilistic method ...
متن کاملTopic Model Stability for Hierarchical Summarization
We envisioned responsive generic hierarchical text summarization with summaries organized by topic and paragraph based on hierarchical structure topic models. But we had to be sure that topic models were stable for the sampled corpora. To that end we developed a methodology for aligning multiple hierarchical structure topic models run over the same corpus under similar conditions, calculating a...
متن کاملDetection of Topic and its Extrinsic Evaluation Through Multi-Document Summarization
This paper presents a method for detecting words related to a topic (we call them topic words) over time in the stream of documents. Topic words are widely distributed in the stream of documents, and sometimes they frequently appear in the documents, and sometimes not. We propose a method to reinforce topic words with low frequencies by collecting documents from the corpus, and applied Latent D...
متن کاملAn Integrated Multi-document Summarization Approach based on Word Hierarchical Representation
This paper introduces a novel hierarchical summarization approach for automatic multidocument summarization. By creating a hierarchical representation of the words in the input document set, the proposed approach is able to incorporate various objectives of multidocument summarization through an integrated framework. The evaluation is conducted on the DUC 2007 data set.
متن کاملA Hybrid Hierarchical Model for Multi-Document Summarization
Scoring sentences in documents given abstract summaries created by humans is important in extractive multi-document summarization. In this paper, we formulate extractive summarization as a two step learning problem building a generative model for pattern discovery and a regression model for inference. We calculate scores for sentences in document clusters based on their latent characteristics u...
متن کامل